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Abstract 

We discuss in detail the asymptotic distribution of sample expectiles. First, we show 
uniform consistency under the assumption of a finite mean. In case of a finite second 
moment, we show that for expectiles other then the mean, only the additional assumption 
of continuity of the distribution function at the expectile implies asymptotic normality, 
otherwise, the limit is non-normal. For a continuous distribution function we show the 
uniform central limit theorem for the expectile process. If, in contrast, the distribution is 
heavy-tailed, and contained in the domain of attraction of a stable law with I < a < 2, 
then we show that the expectile is also asymptotically stable distributed. Our findings 
are illustrated in a simulation section. 

Keywords. M-estimator, expectiles, convergence to stable distributions, asymptotic normal¬ 
ity, uniform central limit theorem 


1 Introduction 


Expectile regression, that is, regression on a parameter that general izes the mean and charac - 
terizes the tail behaviour of a distribution, has been int roduced by Newev and Powell ( Il987 i 


as an alternative to more standard quantile regression; iBreckling and ChambersI (|l988l ) con 


sidered regression based on more general asymmetric M-esti mators. For a recent comparis on 
between quantile and expectile regression and references see ISchulze-Waltruo et al.l (j2014l ) . 
Let y be a random variable with distribution function F and finite mea n E\Y\ < oo. For a 
fixed T S (0,1), the r-expectile = @r(F) of Y has been introduced by 


Newev and Powell 
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(jl987l ) as the minimizer of an asymmetric quadratic loss 


firiF) = argmin£'S'T-(x,y), 

xGR. 


Srix, y) = r/2 [ {{y - x)+)^ - [y^f] + (1 - t)/ 2 [((y - x) f - {y )2], 


( 1 ) 


Apparently, for r = 1/2 one obtains the mean. Alternatively to Sr{x,y) in (ITT), one may use 


other scoring functions for the expectile; these were recently characterized bv iGneitind (|2011 
Theorem 10). 

Compared to quantiles, expectiles require the existence of a first moment and hence lack 
robustness. On the other hand, for any distribution with finite mean, the expectile is unique 
for each r, and the expectile curve is always strictly increasing and continuous. 


More importantly, as a risk measure it has been s 
attractive property of coherence (see 


Bellini et al 


lown recently that expectiles have the 


(j 201 J)), while quantiles suffer from the 


lack of sub additivi t y. In deed, expectiles were shown to be the only coherent, elicitable risk 


measures m 


Ziegell (j2014l h for a disc ussion and comparis on between value at risk (quantiles), 


expectiles and expected shortfall see 


Emmer et al. 

( 20 T 

). Further discussion and application 

given in 

Delbaen 

( 

2013 

) and 

Bellini and Di Bernardino 


( 12013 ). 

In this note we study in detail the statistical, that is, asymptotic properties of the sample 
expectiles. Somewhat surprisingly and in contrast to the mean, for r 7 ^ 1/2 we find that even 
under the assumption of a finite second moment, the sample expectile is only asymptotically 
normal if the distribution function F is continuous at yr{F), otherwise, the limit distribution 
is non-normal. 

First, in Section 12.11 we show uniform consistency under the assumption of a finite mean. 
Next, in Section 12.21 we show that if the distribution function F is continuous at its r- 
expectile //t-(F’), there is an asymptotic linearization of the sample expectile for this r. In 
case of finite second moments, this implies asymptotic normality, but if F is in the domain 
of attraction of a stable law, the sample expectile is also asymptotically stable distributed. 
If F has a jump at yriF), we show in Section 12.31 that also under the assumption of a 
finite second moment, the asymptotic distribution of the sample expectile is non-normal. 
Finally, for a continuous distribution function with second moments, we show the uniform 
central limit theorem for the expectile process. We illustrate our findings in a simulation in 
Section O using the t-distribution with low degrees of freedom as a prototypical example for 
heavy-tailed distributions. Based on an explicit representation of the expectile for discrete 
distributions, we exemplify the nonstandard asymptotic behavior of the empirical expectile 
by a three-point distribution. Proofs are deferr ed to Section SI 
In a recent paper, 


Kratschmer and Zahld ([2013) obtained results on the asymptotics of expec- 
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tiles which are to some extend complementary to our results. Using a non-standard version 
of the functional delta-method allows them to treat both the case of dependent data as well 
as expectiles of parametric estimates of the distribution. However, they only consider the 
case of a finite second moment (they even assume slightly more) and a distribution which is 
continuous at the expectiles, and further do not investigate properties of the expectile process. 


2 Asymptotic properties of sample expectiles 


Newev and Powelll ()l987l i state a number of useful properties of expectiles, mainly for abso¬ 


lutely continuous distributions F. Below we state an extension, and in particular point out 
the assumptions on F which are actually required. Introducing the identihcation function 


Ir{x, y) = T{y - x)l{y>a;} “ (1 “ t) “ y)^{y<x} (2) 

of the expectile, it is well-known that ^riF) can equivalently be dehned as unique solution of 
the first-order condition 


EIr{x,Y)=0, X € M. (3) 

The following identity, obtained by a partial integration, is important for us: 

COO nx 

Ir{x, F) :=EIrix, Y)=t (1 - F{y)) dy - (1 - r) / E{y) dy. (4) 

J X J —oo 

Proposition 1. Let F be a distribution function with finite mean. 

(i) For each r G (0,1) there is a unique solution firiF) to ([I]) or, equivalently, to 

(a) The function /-i.(F) : (0,1) ^ M, r i—)• fj,T-{F), is continuous, strictly increasing, and has 

range {y G M : 0 < F{y) < 1}. 

(in) If F is continuous in a neighborhood of fir{F) for a given r G (0,1), then fi.{F) is 
continuously differentiable in a neighborhood of t with derivative 

r{l-F{fir)) + {l-T)F{fir) 

2.1 Sample expectiles and uniform consistency 

In this section we show strong uniform consistency of sample expectiles. Let Y have distri¬ 
bution function F, with finite first moment Ef\Y\ = E\Y\ < oo, and let ... be i.i.d. 

copies of Y, and let Fn be the empirical distribution function. The empirical r-expectile 

fiT,n — Tt i^^n) 
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can be defined as solution of the equation 


1 " 

Ir{x,Fn) = - '^Ir{x,Yk) = 0. 

n ^^ 


( 5 ) 


k=l 


This type of estimator is often termed Z-estimator, and a large amount of theory is available 
to obtain asymptotic properties for this type of estimators. Alternatively, asymptotic results 
can be derived using the representation as an M-estimator, that is, 


1 . ^ f 

p,r,n = argmin3,gK<5n(a:), Snix) = - Sr{x,Yk) = / Sr{x,y) dFn{y). 

^ k=i 


( 6 ) 


Here, any other scoring function for the expectile (jGneitind . 1201111 could be used instead, they 
all result in the same estimator, the expectile of the empir ical distribution function. 

The measurability of flr,n follows from Theorem (1.9) in iPfanzaell (|l969l l. who studied M- 
estimators under the heading of minimum contrast estimation. More directly, measurability 
follows from the explicit representation of (lr,n in Subsection 13.21 


Theorem 2. Let T, 11,12) ■ ■ ■ be i.i.d. with distribution function F, and assume Ef\Y\ < oo. 
For any ti,Tu G (0,1), ti < Tu, we have 


sup — Lt{F)\ —)• 0 a.S. 

n <r<Tu 


2.2 Asymptotic linearization and convergence to stable distributions 

Let us consider the representation Q of the sample expectile as an M-estimator. Asymptotic 
normality or, more generally, asymptotic linearization, requires that the asymptotic contrast 
function has a second order Taylor expansion at the true parameter. Since \dxSr{x,y)\ = 
\Ir{x,y)\ < c{\x\ + |?/|) for a suitable constant c, we may differentiate the asymptotic contrast 
function 

i/^rix) = ESr{x,Y) = J Sr{x,y)dF{y) (7) 

under the integral sign to obtain 

f,'x{x) = -EIr{x,Y) =:-Ir{x,F). 


( 8 ) 


We see from (jl]) that V’r(^) 

right derivative Tpx~^{x) =r(l — F{x)) -|- (1 — r) E{x) 
left derivative = x{l — F{x—)) -|- (1 — r) E{x—) 

at X, where E{x—) = P(y < x) is the left limit of F at x. For r = 1/2 (i.e. the mean), 
these are always equal, but generally o nly coincide at HriF) if F has no point mass in its 
r-expectile. From Theorems 1 and 10 in lArconesI ((20001) we deduce the following linearization. 
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Theorem 3 (Asymptotic linearization). Let Y,Yi,Y 2 ,... be i.i.d. with distribution function 
F. Assume that < oo and that F is continuous at pr = Pt(F) for a given r € (0,1). 

Let {on} be a sequence of positive numbers which converges to infinity with sup„>]^ < oo, 

such that 

n 

^Y.LAt^r,Y,) = Op{l). (9) 

k=l 

Then 

n 

an(pT,n- hr) = ^ {t( 1 - F(pr)) + (l- t) F(pr)) ^ ^ /r (^r, + Op (1) . (10) 

k=l 

Asymptotic normality 

In case of finite second moments, (l9|) is satisfied with an = \/n by the central limit theorem, 
and we obtain asymptotic normality for a finite number of expectiles. In the following, we 
write —)■ T as a short-hand notation for Yn ^ Y ~ F, where F denotes the distribution 
function of Y. 

Corollary 4. Suppose that EY"^ < oo. Let Ti £ (0,1), i = 1,... ,m be such that F does not 
have a point mass at any of the pra i = I, ■ ■ ■ I'm. Then 

V^(Ari,n hri j ■ ■ ■ t P‘Tm,n h'Tm) ^ -^(O, S) , 

where 

E \Lr. ipT ,Y) Ir. ipr-^Y)] 

^ (r,(l - E{pr,)) + (1 - n) F(pnj) (t /(1 -F(pr^)) + (1 - T,) F(pn^)) 

fori,j = 


Convergence to stable distributions 

A random variable X has an a-stable distribution if its characteristic function is given by 


E [e™^] = 


exp (—|u|" [l — i/3 tan sign(u)]) , a 7 ^ 1, 
exp (—|u| [1 -|- i/3^ sign(n) log |tt|]) , a = 1, 


where 0 < a < 2, /3 G [—1,1]. Assume that Y belongs to th e domain of attraction of an 

IiMtI, Def. 


Embrechts et al 


a-stable distribution (Y G DA(a)) with 0 < a < 2 (see, e.g., 

2.2.7)). This is the case if and only if Y has tail probabilities that satisfy 

P(Y>y) = ^^^^L(y) and P(Y <-y) = ^^^L(y), 


00 , 




yU 


( 12 ) 


where L is slowly varying and c"*", c >0 with c'’'-|-c >0 (jEmbrechts et al .1 . 1 19971 . Th. 2.2.8). 
In the following, we assume 1 < a < 2 to ensure that E\Y\ < 00 . 
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Corollary 5. Let Y,Yi,Y 2 ,... be i.i.d. r.v. with distribution function F E DA{a), where 
1 < a < 2. Assume further that F has no point mass in p-r- Then, 

£ Z 

Li{n) r(l - F(/x^)) + (1 - r) F(/i^) 

Here, Z follows an a-stable distribution, and Li is an appropriate slowly varying function. 


Proof. Since F G DA{a), from (1121) we obtain that 

P{I{pr,Y) >y)=T^ ^l±^L{y) and 

ya 

P{I{pr,Y) <-y) = as y ^ oo. 

_ _ 

Consequently, I{pr,Y) G DA{a), and the general CLT ( Embrechts et ah . 1997 . Th. 2.2.15) 
yields 

J(^t; Y}f) — nEI{pr, Z as n —)• oo, 


where Z follows an a-stable distribution and Li is an appropriate slowly varying function. 
This implies that ([9|) is satisfied, and an application of Theorem [3] together with the general 
CLT yields the statement of the corollary. □ 


Instead of using the assumptions Y G DA{a), suppose more specifically that Y belongs to 
the domain of normal attraction of some a-stable distribution with 1 < a < 2, i.e. Y has tail 
probabilities that satisfy 

y<^p(Y > y) ^ c'^ and y'^P{Y < —y) c~, y ^ oo, (13) 

with c+ -|- c“ > 0 and 1 < a < 2. 


Corollary 6. Let Y,Yi,Y 2 ,... be i.i.d. r.v. with distribution function F that belongs to the 
normal domain of attraction of an a-stable distribution, where 1 < a < 2, that is, satisfies 
m- Assume further that F has no point mass in pr- Then 


where 


Proof. Since 


c {pt 


r(l - F{pr)) + (1 - r) F{pr) ’ 


c = 


2r(a) sin(7ra/2) ^ 
7r(r“c+ -|- (1 — 


fi = 


— (1 — t)‘^c 
r"c+ -|- (1 — t)°‘c~) 


y^P{L{pr,Y) > y) ^ c'^ and y^P{L{pr,Y) < —y) ^ {I — t)°‘ c as y —)• oo. 


this follows from the gen eral CL 
corresponding stable law (jNolanl . 


for distributions in the normal domain of attraction of a 


2015|, p. 22). 


□ 
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2.3 Further asymptotics under finite second moments 

Suppose that Y ^ F with EY^ < oo and VarY >0. In contrast to the mean, asymptotic 
normality of general expectiles as in Corollary 0] actually requires the additional assumption 
that Y has no point mass at /Xt-(F), otherwise, the limit distribution is non-normal, as the 
following result shows. 

Theorem 7. Let Y,Yi,Y 2 ,... be i.i.d. with distribution function F with EY^ < oo. Let 
T G (0,1) and denote Hr = Then 

y/n {hT,n — hr) ^ cTiW lw>o + o'2W lw<o, (14) 


whereW r-. N{0,E[Lr{hT,Y)^]), 

_ 1 _ 1 

r{l-F{nr)) +{l-T)F{^ir)' T{l-F{nr-)) +{l-T)F{^ir-)' 

and F[x—) = P{Y < x) denotes the left limit of F at x. 


We prove Theor em by using empiric al process methods and the argmax continuity theorem 
as presented in IVan der VaartI ()l998l i . Alternatively on e could exploit the convex ity of the 
contrast and modify the assumptions and the proof in iHiort and Pollard ()l993l . Theorem 
2.1) to give an alternative argument. 

In case of a continuous distribution function, we also have convergence of the expectile process. 


Theorem 8. Let T ,be i.i.d. with distribution function E with EY^ < oo. Let 
0 < r; < Tu < 1 and suppose that E is continuous in a neighborhood of [/irpA*r„] • Then the 
sequence of processes 


T I—>■ 


(v^ {hT,n hr)') 


n>V 


r € [ti,Tu], 


(16) 


converges weakly in C[ti,Tu\ to a Gaussian process with continuous sample paths and covari¬ 
ance function given in m- 


Tran et. al 


(|2014l i also show convergence of the expectile process. They argue via convergence 
of an associated quantile process, and therefore require that F has a density, further, they do 
not specify the covariance function of the limit process. 

Theorem [7] shows that process convergence, at least in C[ti,Tu\ or even in 1°°[ti,Tu\, cannot 
be expected if F has a discontinuity in [ti,Tu], since in this case the limit process would be 
discontinuous as well. 
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3 Some Simulations 


3.1 Illustration of convergence to a stable distribution 


As an example for a distribution with finite expectation but infinite variance we consider 
Student’s t-distribution to, 1 < a < 2, with symmetric density 


For Ya 


~ ta, 


fa{x) 


r((a + l)/2) / 

r(a/ 2 )-y/a¥ a J 


X G M. 


lim > y) = lim y'^P{Ya < -y) 

y—>oo y^oo 


r((a + l)/ 2 ) 
r(a/2) 


Accordingly, ta belongs to the domain of normal attraction of some a-stable distribution. To 
compute the theoretical r-expectile, which is the unique solution of 

yr-EY = E [(y - , (17) 

i — r 


one can use the identity 
E [(y„ - y^)+] 


y/aT{{a + l)/2) ( 

^/^T{a — l)r(a/2) \ a 


(1 Paiy-r')') j 


where Ea(-) denotes the distribution function of ta- The limiting behavior of the empirical 
T-expectile then follows directly from Corollary [5l Figure [T] shows the distribution function 
of c {yT,n — yr) (more precisely the empirical distribution function based on 10000 

replications) for sample sizes of 20, 200 and 2000 for several values of r and a. It can 
be observed that the quality of the approximation by the corresponding limiting stable law 
depends on both r and a: the approximation improves for decreasing a (see Figure [T] (a)-(c)) 
and for r approaching the value 0.5 (see Figured] (d)-(f)). 


3.2 Illustration of nonstandard asymptotics under finite second moments 


To illustrate the convergence to a non-normal distribution stated in Theorem [T] we first give 
an explicit formula for the empirical expectile which is interesting in itself. From (|17p . it 
follows directly that the r-expectile satisfies the equivalent conditions 


E\[Y-yrr\ 

E[\Y-yr\] ’ 

(1 - t)E [yi{y + tE [Yl{Y>^lr}] 

{1-t)P{Y <yr)+TPiY > yr) 


(18) 


(19) 










Figure 1: Convergence of the cumulative distribution function (cdf) of the empirical expectile 
to the corresponding limiting stable cdf. 

Upper row; Data follow fa-distribution with different a, t = 0.8 fixed. 

Lower row: Data follow fa-distribution with a = 1.5, different values of r. 
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The subsequent representation follows iBellinil (j2012l i , but formulated for the empirical distri¬ 
bution, and allowing for ties. Let < • • • < T(,^) denote the order statistics of Yi,... ,Yn. 
From dUD, the empirical expectile satisfies 

X/fc ^{Y(^k)<fl-T,n} “b ^ Sfc ^{y(k)>P‘T,n} 

Hence, for fir,n S [Yj;*),y(j+i)), where Y(j) < Yj'j+i), one has 

(1 - '^) Efc=i Y{k) + r ELi+i Efc) 


Defining 


/^r,n — 


Ti : = 


(1 — T)i + T{n — i) 


i = 1 ,... ,n. 


( 20 ) 


( 21 ) 


ELiIEd-^o)!’ 

we have fir,n = Y{i) iff r = r* for f = 1 ,... ,n (and then, (l2T]) is the empirical counterpart 
of (fT8|) '). Note that Tq = 0,t* = 1 , and since fir,n is nondecreasing in r, we obtain that 
T* < tEi, f = l,...,n—1 . As a consequence, 

Ar,n e [y(i),y(i+i)) r e [t*,t*,_i), f = 

Remark. (i) Formulas (1211) and (I20p are especially well-suited for plotting purposes with¬ 
out the need of any numerical root-finding. 


(ii) From (12(1 . jiT-^n is piecewise differentiable in r with 

dfir,n _ i ELi+i Ed -{n-i) EEi Efc) 


dr 


((1 — r)f -|- r(n — f))^ 


for r e (tEtEi). 


If Y has a discrete distribution on 0,1, 2,... (say), an analogous reasoning leads to the fol 
lowing explicit formula for the theoretical expectiles Define 

El=oii - k)P{Y = k) 


Ti := 


f = 0,1,2,.... 


T.k>o\i-k\P{Y = ky 

For r G [EjE+i)! accordingly G [f,i -(- 1), one has 

(1 - E kP{Y = k)+T E,>, kP{Y = k) 


flq- - 


( 22 ) 


(23) 


(1-r)P(y < z)+ TP(y > i) 

Now, assume that Y follows a three-point distribution with P{Y = i) = pi, i = 0,1,2, with 
Po,Pi,P 2 > 0,po+Pi+P2 = 1- Then, from and ([23]), we get Tq = 0,tI = Po/{po+P 2 ),T 2 = 
1 and 


flp — 


I t(pi+ 2 P 2 ) n ^ ^ * 

_ J (l-T)po+'r(pi+P2) ’ 1’ 


(1-t)pi+2tP2 


(1-t)(po+Pi)+tP2 ’ 1 — 


rf < T < 1. 
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(a) r = 0.7, /iT = 0.907 


(b) T = 0.8, /ir = 1 


Figure 2: Density function of the standardized empirical expectile for n = 500 and of the 
corresponding limiting distribution. Data follow a three point distribution in 0,1,2. 

(a) T = 0.7, normal limiting distribution, (b) r = 0.8, non-normal limiting distribution. 


Next, we make the choice po = 4/10,= 5/10,p2 = 1/10. Then, /xq.s = Ij i-e. the 
distribution of Y has a point mass in pr for r = 0.8, but not for other values of r. Figure 
El (a) shows the density of ^/n (/lr,n — IJ-r) (estimated by a nonparametric density estimator 
based on 20000 replications) for sample size 500 and r = 0.7 (hence, pr = 49/54) together with 
the limiting normal distribution given in Corollary SI Figure [2] (b) shows the corresponding 
plot for T = 0.8 together with the limiting non-normal distribution given in Corollary [71 


4 Proofs 


Proof of Proposition [11 Parts (i) and (ii) are from iNewev and Powelll (|l987l l except for the 
general continuity of Pt(.F) in t. From Q we see that Ir{x,F) is a continuous function of 
{t,x). To show continuity of the expectile, first let | r, and let pr = linin Air„(-F) for which 
by monotonicity Pt{F) < Pr- By continuity of Ir{x,F) we have 


0 = lim/.r„(/^r,(-F),F) =Ir{pr,F), 

but since Pt{F) is the unique zero, it follows that Pt{F) = Pr, that is, right-continuity. The 
argument for left-continuity is the same. 

(iii) From ([H) we see that if F is continuous in a neighborhood of x, then /,-(•, T) is 
continuously differentiable at x with derivative —x(l — F{x ))-(! — r) F{x). The conclusion 
follows from the implicit function theorem. □ 
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Proof of Theorem\2[ We start with strong consistency of individual expectiles, that is, 


^ Cl'S. 


(24) 


We may use the repr esentation ([^ of the empirical expectile as a Z-estimator and strengthen 


Van der VaartI (j 19981 . Lemma 5.10) to almost sure convergence. Since x >->• Ir{x, F) is strictly 


decreasing, we have for every e > 0 that 


Irif.lT -e,F) >0> Iriflr + £, F). 


Since i s,Fn) —)• i s,F) a.s. as n ^ oo, we have a.s. that — £,Fn) > 0 > 

+ £,Fn) for large n G N. Since each map x I{x,Fn), n G N, is continuous and has 
exactly one zero /ir,n; this zero must a.s. lie between /ir i £ for large n G N, that is. 


limsup l/lr.n — fi-T{F)\ < e a.s. V e > 0, 

n—>-oo 

showing (fMD . 

Using Proposition [1] (ii) and individual consistency, the classical Glivenco-Cantelli argument 
may be applied. Let d = fir^iF) — finiF), m G N, and choose by continuity t; = tq < 
Ti < ... < Tm = Tu such that = d'niF) + kd/m, k = 1,... ,m. By monotonicity, for 

Tk T 'Tfc+l, 

Ar,n ~ k'riF) < Arj._|.i,n ~ d'Tk+iiF) + ~ d'Tki^)- 


Therefore 

sup (ilr,n - fJ'riF)) < maX lArfe.n “ (-?^) I + d/m. 

Ti<T<ru 0<k<m 

Similarly, 

sup (fJ.T{F)-[ir,n)< max fJ^Tk{F)\+d/m. 

n<T<Tu 0<k<m 

Since 


sup I Ar,n - Mr (i^) I = max ( SUp (Ar,n “ Mr (-?^)) , SUp (Mt (-?^) “ Ar,n) ) , 

Tl<T<Tu ^Tl<T<Tu n<T<Tu ^ 


we have for any m G N that 


limsup sup I Mr,n — Mr (.^) I < limsup max — d-Tk{P)\F d/m = d/m a.s. 


n Tl<T<Tu 


n 0<k<m 


□ 


We shall derive Theorem [3] from Theorems 1 and 10 in lArconesI (|2000|). For convenience, we 
state a version of these results, tailored to our needs. 
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Theorem [Theorems 1 and 10 in lArconea i200u ) ] Let Y, Yi, Y2) • • • be i.i.d. with distribution 
function F. Let s' : —)• M be a function such that g{-,'&) : M —)• M is measurable for each 
i? e M. Let dn be a sequence of r.v.’s satisfying 


n 


= inf n 


k=l 


k=l 


Suppose that: 

(A.l) dn A do, do e M. 

(A.2) There is a positive constant V such that 

Elg(Y, d) - g(Y, t?o)] = V(d - dof + o(|i? - i?oP), 


as d —>■ do- 

(A.3) Let : M —)• M and let {an} be a sequence of positive numbers which converges to 
infinity with sup„>]^ < 00 such that 


n-^^^iYj)-E[ipiY)]] =Op(l). 
i=i 


(A.4) There is a function ^ : M —)• M with £'|C(Y)| < 00 such that 


lim E 

s^o 


sup 

|i?|<5 


r{Y,d) - d^ C{y)\ 


= 0 , 


where r(y, d) = g{y, do Ed)- g{y, do) - d(p{y). 


Then, 


.{L + ^ E*’(U) - my)] I «. 


(25) 


i=i 


Proof of Theorem O We verify the conditions of the above theorem for do = firiF), g{y, d) = 
ST{d,y), (p{y) = -Ir{gT{F),y) and 

T 1 — T 

Ciu) = TJ ^{y>tJ.T} “I 5 ^{y<Air} ■ 


(Al) follows from Theorem [2j 

(A2) follows from ©,(181), the assumption of continuity of F at frr{F), and Taylor’s theorem, 
which holds under the minimal assumption of an existing second derivative. 
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(A3) is dH). 

Finally, for (A4) we compute that for x > 0, 


y) y) a -) y') 

_ T ( \2 1 I 2-1 

~ 2\y^^'r ^{^^T<y<^^T+x} 2 ^ ^{y>^^T} 

+ 2 (y ~ y-r ~ x) ^{ij,T<y<fiT+x} H 2 ^ ^{?/<A‘t} 

and similarly for x < 0. Therefore for some c > 0 we may estimate 

\Sr{yT + X,y) - S{Hr,y) +xIr{fiT,y) - x'^C{y)\ <c{y -fir- xf 'i-{y^-\x\<y<y^ + \x\} 

<CX '^{y,r-\x\<y<y,r + \x\} 1 


and therefore 


E 


sup 

|a:|<5 


\Sr{yir + X, y) - Sr{yT,Y) + xIr{iiT,Y) - x‘^C{Y)\ 


<c P (//t — (5 < y < /Tt- + <^) —>■ 0, (5 —)■ 0, 


since Y does not have a point mass at ^r- D 

Proof of Theoreml^ We start by establishing Lipschitz continuity of Sr{x,y) as a function 
of X with square-integrable Lipschitz constant. Since dxSr{x,y) = —Ir{x,y), we have for 
Xl,X2 G Bsinr) 


\Sr{xi,y) - Sr{x 2 ,y)\ < cm{y)\xi - X 2 \, rn{y) := sup \lr{x,y)\. 

xeBsitlr) 


Then, the inequality 


(26) 


'ixi{y) < sup |x — y| < sup |x| + \y\ 
xeBs{y,T) xeBsi^r) 

yields E[m{Y)‘^] < oo if EY'^ < oo, that is, the Lipschitz constant has finite second moment. 
Next, the asymptotic contrast in ([7]) is continuously differentiable with left and right deriva¬ 
tives in fij- given in ([8]). From Taylors formula, we obtain 

'Iprix) - 'fprif^r) = (x - {yLr) /‘^ + o(|x - /i^P), 

'Iprix) - V’r(^r) = {x - yrf'lpl~{yr)/‘^ + o(|x - /i^P), 

where ipx^{yLr) are right/left second derivatives. 


X > fir, 


X < yLr, 


(27) 
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Therefore, the assumptions of Theorem 5.52 in IVan der VaartI (jl998l i are satisfied with a = 2 
and /3 = 1 (see the argument in Corollary 5.53, that the Lipschitz property (|26p implies the 
concentration inequality), and we obtain the y/n-iate of convergence: 

^/^ (/^T,n /^t) — 

To obt ain the asympt o tic di stribution, we apply the argmax-continuity theorem, Corollary 
5.58 in IVan der VaartI (199a). To this end, for a measurable function / with Ef'^{Y) < oo, 
denote 

1 " 

Pn/ = -J^/(n), Pf = EF{Y), and G„(/) = (P„ - P)/. 


k=l 


By the Lipschitz property (I26p . from the proof of Lemma 19.31 in IVan der VaartI ()l998l i we 
obtain for any M > 0 that 


sup Gn[Vn {SrifJ-T + h/^/n, •) - Sri^r, •)) + hlriHr, ')] 
\h\<M 


0 (P). 

Therefore, for any M > 0, the difference between the processes 

hi-)- ^/nFn[^/n {SriHr + h/y/n,-) - S'.r(//r,-))], 1^1 < Tf, 

and 

h^n [V’r(AiT + h/Vre) - •^.^(/Ur))] - hGnlrifJ^r, ■), \h\ < M, 

tends to 0 in probability in sup-norm. Using (EZl), the second process converges to the 
Gaussian process 


In In 

h e--— h lh>o + 7^— h l/i<o — hW, 

Zai 2(72 


(28) 


where W is normally distributed as in the theorem, hence so does the first. Prom the argmax 
- continuity theorem, we obtain weak convergence of the minimizers ^/n {fiT,n ~ A^t) to the 
minimizer of the limit process. Now, a parabola h i—)• —hW -|- h?I{2a) for some (7 > 0 is 
minimized at h = aW, yielding the negative value —aW‘^12. Therefore, the minimizer of 
(f28|) is at h = (TiVU for IT >0 and at h = (72IT for IT < 0, which gives the statement of the 
theorem. □ 


Van der Vaart 


(119951. Th e orem 1), which gives asymp- 
( 2008 . Theorem 13.4), which 


Kosorok 


Proof of Theorem [8l We shall apply 
totic normality of functional Z-estimators; see also 
additionally implies validity of the bootstrap. First, Theorem [2] gives the uniform consis¬ 
tency. Given zz € C[ti,Tu\ C /°°[r;,r„], the functions r i—;■ Ir{v{T),F) and r i—)■ Ir{r'{T), Fn) 
are also in C [r;, r^], and 


T I y F^ — 0, T I >■ Pn) — 0. 
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Next, we check the conditions (2), (3) and (4) in IVan der VaartI (jl995l h Suppose that v G 
C [t; , r„] is such that F is continuous on the image of v (this is true by our assumption if 
Wv — is small enough). Then we apply the mean value theorem for each r G [ti,Tu] to 

obtain 

\It{i^{t),F) - Ir{^r,F) + [r(l - F{Hr)) + (1 “ 'r)F{^ir)] {u{t) - fir)\ 

- F{Hr)\ - Hr\, 

where is between z/(r) and fir- Since F is uniformly continuous in a compact neighborhood 
of [ti,Tu], we obtain 

sup \lr{li{T),F)-Ir{fir,F) + [T{l-F{fir))+{l-T)F{fir)] {iy{T)-fir)\ = o{\\iy - fi\\[r, ,ru]) , 

relTi,Tu] 

(29) 


showing Frechet differentiability, that is, (4) in IVan der VaartI (jl995l b Note that the deriva¬ 
tive, multiplication with the function r “[''“(l “ -^(/^r)) + (1 ~ "?■) -^(/^r)] is continuously 
invertible. 

Since by Proposition [H (iii), firiF) is continuously differentiable in r, we have for an appro¬ 
priate constant c > 0 that 

\lri{fiTi{F),y) - Ir2{fiT2{F),y)\ <c\y\\Tl-T2\, Tl,T2 & [ti,Tu], 2/ G M, (30) 


so that 

is a Donsker clas s of fu nctions, see 
in IVan der VaartI (|l99^ _ 


(.fr(/^r(-P), •)).rGh,T„] 

1 1 998l . Example 19.7.), taking care of (2) 


Van der Vaart 


Finally, to show (3) in IVan der VaartI (jl995l b we choose 6n i 0, and estimate in the first step 


sup sup ^/n\Ir{v{T),Fn) - Ir{l^{T),F) - [lr{flr,Fn) - Ir{yT,F)]\ 

TG[Ti,T„] 

< sup sup ^/n\Ir{fir + X,Fn) - Ir{yT + X,F) - [lr{fiT,Fn) - Ir{yT,F)]\. 
|a;|<< 5 n rG[r;,r„] 


(32) 


Now, for a constant C > 0, y G M, r, ri, G [T;,r„], |x|, |xi| < 1, 

+xi,y) - Iri{fiTi,y) - [/r(lir + a;,y) - Ir{fiT,y)] \ < C\y\{\xi - x\ + \ti - r|), 

\lr{fir+X,y) - Ir{yT,y) < C\y\ |x|. 


Therefore, 


sup sup E(^Ir[fiT + X,Y^ — Ir[fiT,y))‘^ ^ C EY"^ 5 r, 

|x|<5„ rG[Ti,Tu] 
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and each 


T — 

'J n, — 


|(x,r) i-> lr{}ir + x,y) - Ir{fiT,y), ^1 < Sn, T & [r/,r„]| 


is a Lipschitz-class of functions. Therefore we may estimate (I32p by t he bracketing i n tegral 


Van der VaartI (|l998l . 


J[] ((5n, T'n, L 2 (T)) and an additional sequence converging to zero, by 
Lemma 19.34 and Example 19.7), which together —)• 0 as n —)■ oo and 6n i 0. 

Next, we show that weak convergence is actually in C\ti,Tu\. The expectile processes (HID 
have continuous sample paths. As for the limiting Gaussian process, it suffices to show conti¬ 
nuity of the sample paths of the limit Gaussian process of the empirical process corresponding 
to the function class m, since the inverse o f the Frechet derivativ e in (1291) is simply multi¬ 
plication by a fixed continuous function. Bv IVan der VaartI (119981 . Lemma 18.15), the limit 
process can be constructed to have continuous sample paths w.r.t. its standard deviation semi¬ 
metric. In order to check that continuity also holds w.r.t. the ordinary distance on [ti,Tu], we 
show that 


< C'^{T2-Tlf, Tj G [fJ-ri, , j = 1,2 (33) 

for some C > 0. But this follows immediately from (|30p upon squaring and integrating. This 
concludes the proof of the theorem. □ 
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